Sentence design for speech synthesis and speech recognition database by phonetic rules
نویسنده
چکیده
This paper describes the processing of 2465 sentences (or utterences) which are collected by phonetical rules from a big corpus--recent years' newspaper, "People's Daily" and etc., as materials of speech recognition and speech synthesis database. In these sentences, both phonetic phenomena and sentence patterns are included. We first consider the phonetic distribution among syllables: inter-syllabic diphones, inter-syllabic triphones and final-initial structure. The syllabic balance ensures the intra-syllabic phenomena such as phonemes, initial/final and consonant/vowel. There are roughly 17 kinds of sentence patterns which appear in our sentence set. We have also created a set of phonetically balanced 2-4 syllable phrases which includes all of the tone structures.
منابع مشابه
Evolving Phonological Rules Using Grammatical Evolution
Phonetic transcription is a core procedure of continuous speech recognition systems and speech synthesizers. The more correct phonetic translation is the more successful applications using phonetic transcription are. Phonological rules translate text to graphemes. These rules significantly reduce size of databases with words and it’s phonetic transcription and speeds up transcription process. T...
متن کاملMultilingual non-native speech recognition using phonetic confusion-based acoustic model modification and graphemic constraints
In this paper we present an automated approach for non-native speech recognition. We introduce a new phonetic confusion concept that associates sequences of native language (NL) phones to spoken language (SL) phones. Phonetic confusion rules are automatically extracted from a non-native speech database for a given NL and SL using both NL’s and SL’s ASR systems. These rules are used to modify th...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملDesign and Implementation of an Intelligent Part of Speech Generator
The aim of this paper is to report on an attempt to design and implement an intelligent system capable of generating the correct part of speech for a given sentence while the sentence is totally new to the system and not stored in any database available to the system. It follows the same steps a normal individual does to provide the correct parts of speech using a natural language processor. It...
متن کاملFinnish and Estonian Speech Applications Developed on an Object-Oriented Speech Processing and Database System
QuickSig, an object oriented signal processing system that represents a modern tool with which to perform DSP related studies, is presented. It empowers speech scientists to operate in a flexible and motivating environment where signals, filters, spectrograms, etc., are all modelled as objects. Seamlessly integrated to QuickSig is an object-oriented database that permits signals along with thei...
متن کامل